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Abstract. Broadcasting algorithms are important building blocks of distributed systems. 
In this work we investigate the typical performance of the classical and well-studied push 
model. Assume that initially one node in a given network holds some piece of information. 
In each round, every one of the informed nodes chooses independently a neighbor uniformly 
at random and transmits the message to it. 

In this paper we consider random networks where each vertex has degree d > 3, i.e., the 
underlying graph is drawn uniformly at random from the set of all d-regular graphs with n 
vertices. We show that with probability 1 — o(l) the push model broadcasts the message 
to all nodes within (1 + o(l))Cdlnn rounds, where 

c 1 

d ln(2(l-i)) dln(l-i)- 

Particularly, we can characterize precisely the effect of the node degree to the typical 
broadcast time of the push model. Moreover, we consider pseudo-random regular networks, 
where we assume that the degree of each node is very large. There we show that the 
broadcast time is (l + o(l))C Inn with probability 1 — o(l), where C = Ymid^oc Cd = jrg +!• 



1. Introduction 

1.1. Rumor Spreading and the Push Model. In this work we consider the classical 
and well-studied push model (or push protocol) for disseminating information in networks. 
Initially, one of the nodes obtains some piece of information. In each succeeding round, every 
node who has the information passes it another node, which it chooses independently and 
uniformly at random among its neighbors. The important question is: how many rounds 
are typically needed until all nodes are informed? 

The push model has been the topic of many theoretical works, and its performance was 
evaluated on several types of networks. In the case where the underlying network is the 
complete graph, Frieze and Grimmett [16] proved that with high probability (whp.) (i.e., 
with probability 1 — o(l)) the broadcasting is completed within (1 + o(l))(log 2 n + Inn) 
rounds, where n denotes the total number of nodes. Recently, this result was extended by 
the two authors and Huber |14j to the classical Erdds-Renyi graph G UtP , which is obtained 
by including each of the possible \Z) edges with probability p, independently of all other 
edges. Among other results, they showed that if p = oj(-^), then the typical broadcast time 
essentially coincides with the broadcast time on the complete graph. In other words, as long 
as the average degree of the underlying graph is significantly larger than Inn, the number 
of rounds needed is not affected. However, prior to this work, there was no result describing 
the performance of the push model on significantly sparser networks. 

The typical broadcast time of the push model was also investigated for other types of 
networks, albeit not as precisely. Feige et al. derived in [13] bounds that hold for arbitrary 
graphs. Moreover, they proved a logarithmic upper bound for the number of rounds needed 
to broadcast the information if the underlying network is a hypercube. This result was 
generalized by Elsasser and Sauerwald, who determined in [12] similar bounds for several 
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classes of Cayley graphs. Bradonjic et al. 0] considered random geometric graphs as under- 
lying networks, and proved that whp. the broadcast time is essentially proportional to the 
diameter of these graphs. 

1.2. Our Contribution. The main contribution of this paper is the precise analysis of the 
push model on sparse random networks. Note that in this context the study of the G n n 
distribution is not appropriate, as we would have to set p = c/n for some constant c > 0. 
However, for such p the random graph G ntP is typically not connected. In fact, if we took 
any p = o (^r), we would face the same problem, as such a p is below the connectivity 
threshold for G n ^ p (see for example |19j). 

A candidate class of random graphs that combines the feature of constant average degree 
with that of connectivity is the class of random <i-regular graphs G(n, d) for d > 3. It 
is well-known that a random <i-regular graph on n vertices is connected with probability 
1 — o(l). Thus, a typical member of this class of graphs is suitable for the analysis of the 
push protocol as far as the effect of density is concerned. Let T = T(G(n, d)) denote the 
broadcast time of the push model on G(n, d). Note that in this case the choice of the vertex 
where the information is placed initially does not matter. 

Theorem 1. With probability 1 — o(l) 



The above theorem is interpreted as follows: for almost all d-regular graphs on n vertices, 
with probability 1 — o(l) the push protocol broadcasts the information within the claimed 
number of rounds. It is easy to see that as d grows converges to ^ + 1> which is 
the constant factor of the broadcast time of the push protocol on the complete graph, as 
shown by Frieze and Grimmett [16]. Thus our result reveals the essential insensitivity of 
the performance of the push protocol regarding the density of the underlying network and 
shows that the crucial factor is the "uniformity" of its structure. 

We explore further this aspect and we consider regular graphs whose structural charac- 
teristics resemble those of a regular random graph. In particular, we consider expanding 
graphs whose "geometry" is determined by the spectrum of their adjacency matrix. 

In Subsection II. 3l below we give an intuitive description of the evolution of the randomized 
protocol, thus explaining also how do the two summands involved in come up. 

1.2.1. Regular expanding graphs. Expanding graphs have found numerous applications in 
modern theoretical computer science as well as in pure mathematics. Their properties to- 
gether with the theory of finite Markov chains have led to the solution of central problems 
such as the approximation of the volume of a convex body, approximate counting or the ap- 
proximate uniform sampling from a class of combinatorial objects. The latter applications 
have had further impact outside computer science such as in the field of statistical physics. 
We refer the reader to the excellent survey of Hoory et al. [T7] for a detailed exploration of 
the properties and the numerous applications of expanding graphs. 

The main feature of an expanding graph is that every set of vertices is connected to the 
rest of the graph by a large number of edges. This key property makes random walks on 
such graphs rapidly mixing and has led to the above mentioned applications. Moreover, 
this property makes expanding graphs an attractive candidate for communication networks. 
Intuitively, the high expansion of a graph implies that information that is initially located 
on a small part of the graph can be spread quickly on the rest of the graph. This becomes 
possible as the high expansion of a graph ensures the lack of "bottlenecks", that is, local 
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obstructions on which a broadcast protocol would need a significant amount of time in order 
to bypass them. 

We focus on a spectral characterization of expanding graphs, which is related to the 
spectral gap of their adjacency matrix. Let G = (V, E) be a connected d-regular graph and 
let A be its adjacency matrix. The Perron-Frobenius Theorem implies (see Proposition 2.10 
in [20]) that the largest eigenvalue of A equals d and that the corresponding eigenvector is 
proportional to the all-ones vector [1, . . . , 1] T . Let Ai, . . . , \ n be the eigenvalues of A ordered 
according to their value (note that since A is symmetric, these are all real). Set 

A := X(A) := max |Aj|. 

2<i<n 

If G has n vertices we say that G is an (n, d, A) graph. One can show (see for example p. 
19 in [20]) that A = £l(y/d). In particular, Alon and Boppana, Nilli |23] and Friedman [15] 
have shown that for every c?-regular graph on n vertices we have A2 > 2y/d — 1(1 — o(l)). 

We are interested in the class of cl-regular graphs for which A almost attains this lower 
bound. In particular, we are concerned with the broadcast time of the randomized pro- 
tocol on expanding ci-regular graphs on n vertices with A = 0(y/d). Such graphs can be 
explicitly constructed through number-theoretic or group theoretic methods (see the survey 
of Krivelevich and Sudakov [20] where numerous examples are presented). Informally, we 
show that if d = oo(y/n), then the broadcast time is essentially the broadcast time on the 
complete graph with n vertices. 

Theorem 2. Let G be a connected (n,d,X) graph with A < C\fd and d > 2Cv n In 1 / 9 n. 
Then for any v G V , with probability 1 — o(l) 

\T(G, v) — (log 2 n + Inn) I = o(m n). 

Again, this theorem shows the insensitivity of the broadcast time on the density of the 
underlying network. In fact, the assumption that A = 0(y/d) does not merely yield the high 
expansion of the graph, but it also implies that the edges of the graph are distributed in 
a uniform way among each subset of vertices. As we shall see in the proof of Theorem [2j 
this assumption implies that the structure of the graph is not very different from that of a 
random graph on n vertices and edge probability equal to d/n. For example, the number 
of edges between a subset S and its complement is close to ^|5|(n — \S\), which is the 
expected value in the random graph with edge probability d/n. In this sense, such graphs 
are pseudorandom. This notion was introduced by Thomason [26j and was explored further 
by Chung, Graham and Wilson [5], especially regarding its spectral characterization. 

1.3. The evolution of the randomized protocol in a nutshell. Roughly speaking, 
the evolution of the protocol consists of three phases, which have different characteristics 
regarding the rate in which the information is spread. 

Let us consider the first phase, which ends when there are at least en informed vertices, 
for some very small e > 0. Let us denote by It the set of informed vertices (i.e., those who 
possess the information) , and by lit the set of uninformed vertices at the beginning of round 
t + 1 of the push model. Moreover, let e be some edge that is incident to a vertex in It 
that has not been used up to now to transmit a message, and let St be the set of such edges. 
Then we show that the subgraph of G(n, d) induced by I t is essentially a tree, and moreover, 
that St contains ~ 2*(1 — 4)* edges. To see this, note that as every vertex informs some 
specific neighbor with probability 1/d, the expected number of edges from St that are going 
to be used is \St\/d. This means that ~ \St\/d new vertices are going to be informed (as the 
set of informed vertices induces a tree), implying that |£t+i| ~ \£t\ — \St\/d+ (d — l)\St\/d, 
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as for every vertex that becomes informed in this round the number of edges counted in St 
increases by d — 1. So, \£t+i\ ~ 2(1 — jj)|£t|. Note that in this calculation we worked only 
with expected values. In the actual proof we have to show that all the relevant quantities are 
sharply concentrated around their expectations. To this end, we use a variant of Talagrand's 
inequality by McDiarmid |22j (Theorem 0]), which has not been used very frequently in 
the analysis of distributed algorithms. We believe that it could be widely applicable to 
the analysis of existing or future randomized protocols with several different degrees of 
dependency. 

As soon as the number of informed vertices is > en, then after very few rounds the number 
of informed vertices is already (1 — e)n. Here it is essentially the expansion properties of 
G(n, d), which guarantee that every large set of vertices has linearly many neighbors and, 
thus, with high probability a certain fraction of those become informed in each round. 

During the final phase, the number of remaining uninformed vertices shrinks by a factor 
of (1 — Indeed, suppose that there are o{n) uninformed vertices. Then we expect that 
almost all of them have the property that the number of their neighbors in X t is d, implying 
that the probability that any one of the remains uninformed is precisely (1 — An easy 
calculation shows that a "typical" subset of G(n, d) has this property. However, the set of 
uninformed vertices might not be typical at all, implying that we need additional effort to 
guarantee the desired properties. 

2. Concentration inequalities 

In this section we will state two concentration inequalities that will serve as the backbone 
of our proofs. The first one is a Chernoff-type bound for sums of negatively correlated 
random variables, see e.g. [8]. 

Theorem 3. Let I\, . . . ,I n be a family of indicator random variables on a common prob- 
ability space, which are identically distributed and negatively correlated, i.e., E(Iilj) < 
E(Ii)K(Ij) for all 1 < i,j < n. Let X := £)" =1 Then, for any t > 

F(\X-E(X)\ > t) < 2exp ( —4 rr\ • 

Vl v n ! y \ 2(Epf)+f/3)/ 

The next concentration inequality that we will need is due to McDiarmid [22], and it 
is based on the work of Talagrand [25]. We give first a few necessary definitions. Let B 
be a finite set and let Sym(B) be the set of all permutations on B. Assume that ir is an 
element of Sym(B), drawn uniformly at random. Also, let X = (Xi, . . . ,X n ) be a finite 
family of independent random variables, where Xj takes values in a set Finally, set 
n = Sym(B) xn"=i%- 

Theorem 4. Let c and r be positive constants. Suppose that h : O — > R+ satisfies the 
following conditions. For each (a, x) G f2 we have 

• if x' differs from x in only one coordinate, then \h(o~, x) — h(o~, x')| < 2c; 

• if a' can be obtained from a by swapping two elements, then \h(o~, x) — h(o~',x)\ < c; 

• if h(o~,x) = s, then there is a set of at most rs coordinates such that h(o~',x.') > s 
for any (a',x') G Q that agrees with (a, x) on these coordinates. 

Let Z = h(ir, X) and let m be the median of Z . Then, for any t > 

F(|Z-H>t)<4exp(- i6rc2 ^ + t) ). 
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3. Properties of random regular graphs and the configuration model 

3.1. The configuration model. We perform the analysis of the randomized protocol us- 
ing the configuration model introduced by Bender and Canfield pQ and independently by 
Bollobas [2]. For n > 1 let V n := {1, ... ,n}. Also for those n for which dn is even, we 
let P := V n x [d]. We call the elements of P clones. A configuration is a perfect matching 
on P. If we project a configuration onto V n , then we obtain a d-regular multigraph on V n . 
Let G(n,d) denote the multigraph that is obtained by choosing the configuration on P uni- 
formly at random. It can be shown (see e.g. [191 p. 236]) that if we condition on G(n,d) 
being simple (i.e. it does not have loops or multiple edges), then this is distributed uniformly 
among all <i-regular graphs on V n . In other words, G(n,d) conditional on being simple has 
the same distribution as G(n,d). Moreover, Corollary 9.7 in [19] guarantees that 

(3.1) lim F(G(n,d) is simple) > 0. 

n— »oo 

(Of course the above limit is taken over those n for which dn is even.) Let A n be a subset of 
the set of d-regular multigraphs on V n . Altogether the above facts imply that if P(G(n, d) £ 
A n ) — y as n — y oo then also P(G(n, d) £ A n ) — > 0. This allows us to work with G(n, d) 
instead of G(n, d) itself. 

3.2. Some useful facts. We continue by introducing some notation. Let G be a graph, 
and let S, S' be subsets of its vertices. Then we denote by ec(S) the number of edges in G 
joining vertices only in S, and by ec(S, S') the number of edges in G joining a vertex in S 
to a vertex in S' . Moreover, we denote by Tc(v) the set of neighbors of a vertex v in G. 

Lemma 5. Let A, B C V n x [d] be two disjoint sets of clones, and let C C V n be a set of 
vertices such that (C x [d]) n (AuB) = 0. Let M be a matching drawn uniformly at random 
from the set of perfect matchings on the union of the clones in A,B and C x [d], and set 
N := \A\ + |B| + d|C| - 1. Then 

(3.2) E(e M (A))= ('f)^, E(e M (A,B)) = \A\\B\±, and M(e M (A,C)) = d\A\\C\± 

Moreover, let Hi denote the number of vertices in C that are adjacent to exactly £ clones 
in A in M, where < £ < d. Then, if \B\ > \A\ = w(lnn) 

Finally, let Q = X^>2^- Then, if N > 4 

(3.4) E(Q) < d 2 \A\ 2 \C\N~ 2 . 

Let X be any of eM(A),eM(A,B),eM(A,C) or Hi, and let /j, = ~E(X) . Then, if fi = uj (In 2 n), 
for any e = uj^i" 1 / 2 ) 

(3.5) F(\X - fj,\ > en) < 4e esp+iy^. 

Proof. Let e, e' be edges whose endpoints are in the union of the clones in A, B and C, and 
let L e ,L e > be the indicator variables for the events that e £ M and e' € M. As the number 
of matchings with e is equal to the number of matchings with e' we have E(/ e ) = E(/ e '). 
Hence, as ^2 e L e = ^ip- always, we infer that E(/ e ) = ^. By linearity of expectation this 
proves (I3.2p . 

To see (I3.4p let L e ^ e i be the event that both e and e' are in M. Note that if e n e' 7^ and 
also e / e', then E(/ eje /) = 0. Otherwise, let /, /' be any two edges satisfying /n /' = and 
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/ 7^ /'. Then, as the number of matchings with e, e! is equal to the number of matchings 
with /, /' we infer that E(J eje /) = K(Ifj/). As ^2 eJ L e i I e ,e' = always and as there 

are 3( Ar 4 ^ 1 ) ways to choose e, e' such that e n e' = and e / e' we obtain that 



, if e n e' / and e / e' 



i , if e = e' 



2 



, otherwise 



JV(JV-2) 

Let u = {ui, . . . , frf} be any vertex in C. Moreover, let now e, e' be distinct edges with one 
endpoint in A and the other in v, and note that there are ('"^') • \C\(^) ways to choose e 
and e'. If N > 4, then E(J eje /) < 4iV~ 2 , and this completes the proof of (f!T4"|) . 

To see (|3.3p let v € C and denote by L v the event that there is an edge in M connecting 
two clones of v. Moreover, let Hi(v) denote the event that v is adjacent to exactly £ clones 
in A. Then 

(3.6) V(H e (v)) = F(H e (v) nL^) + P(H e (v) \ L V )P(L V ). 

We estimate the above probabilities one by one. We shall begin with ¥(L V ). Note that there 
are at most d 2 choices for an edge that connects two clones of v, and that the probability 
that such an edge is in M is jj. Hence, 

(3.7) P(L V ) < d 2 N^ = o(ln _1 n). 

Next we estimate W(Hg(v) n L v ). Let us for the moment fix I clones c±,...,ci in A, and £ 
clones c[, . . . , of v. Note that there are ('y) choices for the Cj's and (fy choices for the c^'s. 
Then the number of matchings where the Cj's are matched to the c^'s, and no one of the 
remaining clones of v is matched to a clone in ^4, and there is no edge connecting two of the 
clones of v, is £\ • ( |B|+ ^ I_1) ) (d - t)\ ■ M ]A]m+m _ 2) , where M n = (n/2 ^„ /2 denotes the 
number of perfect matchings on n vertices. Stirling's formula yields the approximation 

(3.8) M n = (1 + e^" 1 )) • \/2n re/ V re/2 . 
Moreover, our assumption \B\ > \ A\ = w(lnn) implies that 



Vlnn/y £\ \ d-l ) \ \lnnJ J (d-£) 

All the above facts together yield that 

V{Hi{v) nL- v )=(l + o(^-)).(f] \Af(N - \A\r* MN+1 - 2d 



AnnJ J \£J M N 
By applying the estimate for M n we infer that the last fraction equals 

, N+l-2d 

1 x\ (N + l-2d)^- / , / 1 



d-l 



So, 

( , 9) m^-^MVffl'i 1 -®) 

Finally, we estimate ¥(Hg(v) \ L v ). Note that the event He(v), given L v , implies that there 
are £ clones of v that are matched to some clones in A. By a similar reasoning as above we 
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infer t licit 

Note that our assumption \B\ > \A\ implies that ^ < i ^j+jgi < \. So, 1 — ^ > \, and (13.7 
together with (|3.6p imply that 



Inn 



By plugging this into (j3.6[) we thus complete the proof of (|3.3p . 

We finally prove the concentration of X by applying Theorem 2] as follows. We will first 
specify the families X and ix. Here, X = 0. The random permutation tt corresponds to the 
random perfect matching on the union of the vertices in A, B and C. More precisely, assuming 
that this union consists of 2k clones, which are labeled 1, . . . , 2k, we consider a uniformly 
random permutation of these clones tt := (i\i2 ■ ■ ■ «2fc-i^2fc)- Then we match the clones that 
are in consecutive pairs, that is, we choose the matching {(11,12), (h,n), ■ ■ ■ > (*2fc-lj i>2k)}- 
This is a uniform perfect matching on these clones. Note that the pair (X, tt) determines 
the value of X. Moreover, 

• if we swap two elements of tt, then X can change by at most 2; 

• if X = £, then we need to specify at most dl elements of tt in order to certify this. 
Thus, we may take c = 2 and r = d in Theorem UJ Moreover, let Mx be the median of X. 
An easy calculation shows that \M X - E(X)| = 0(y/E(X)) (cf. Example 2.33 in [19]). The 
proof completes by applying Theorem [4] with, say, t = l.leMx- □ 



4. Analysis of the Randomized Broadcasting Algorithm 

4.1. The preliminary phase. Let Tq be the first round in which the number of informed 
vertices exceeds ln 7 n. We will show the following statement; it is not best possible, but it 
suffices for our purposes. 

Lemma 6. With probability 1 — o(l) we have that To = O(lnlnn). Moreover, for sufficiently 
large n the subgraph induced by the vertices in It is with probability 1 — o(l) a tree. 

Proof. Let T>i denote the number of vertices at distance i from vertex 1 . We will first show 
that whp. we have \T>i\ = d(d — l) l_1 for all 1 < i < y/lnn, which implies that the subgraph 

induced by VJ^^ n T>i is whp. a tree. To see the claim, we work in the configuration model 
and expose the sets T>i one after the other, i.e., we first expose the edges in the random 
matching that contain the clones of vertex 1, then the edges that contain the (remaining) 
clones of the vertices in T>\, and so on. 

Suppose that = d(d — 1) 4_1 for all i < j < \/lnn. This implies that all edges in the 
matching incident to the clones corresponding to the vertices in T>±,.. . 1 are exposed. 
Moreover, for every vertex in T>j there is precisely one clone whose neighbor is exposed, 
and for all other d — 1 it is not. Let us denote by J-j this set of unexposed clones. We 
have \J-j\ = d(d—iy, and let us note for future reference that with room to spare \J-j\ < n 1 / 3 . 

Clearly, "Dj+i consists of all vertices in C = V n \ (T>i U ■ ■ ■ U T>j) for which at least one of 
their clones is connected in the matching to some clone in Tj. Let Q denote the number of 
such vertices with the property that they are matched to at least two clones in JFj, and let 
M be a random perfect matching on the union of the clones in Tj and C . Then 

l^j'+il = \Fj\- 2 e M{Fj) ~ Q- 
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By applying Lemma [5] with A = J-j, B = and C as above we obtain for large n that 
E(e M (^)) < n _1/3 and E(Q) < 2d 2 ^ 1 / 3 . 

So, with probability at least 1 — 3d 2 n -1 / 3 we have that emi^Fj) = Q = 0. The proof of the 
claim completes by applying the above argument for i = 1 . . . y/lnn. 

With the above fact we can prove the lemma as follows. Let v be a vertex in Pj, for 
some 1 < i < 10 In In n := I, and denote by T v the time until v gets informed. Let v' be 
the unique neighbor of v in Then T„ = T„/ + where X vv i is a geometrically 

distributed random variable with success probability d . Moreover, X vv > is independent 
of T v >. In other words, we have that T v = X^j=i^i' where the X^s are iid. variables as 
above. So, E(T„) = di, and by Theorem [3] 

r> n 1 (19di) 2 ... 

P(T„ > 20d 2 i) = ¥(Bin(20d 2 i,d~ 1 ) < i) < 2e~^o3r < 2e _4d \ 

In particular, for i = £, this probability is at most 21n -4M n. Moreover, the total number of 
vertices in u| =1 Pj is at most d d d ^ < ln 2M n. So, by Markov's inequality, there is no vertex 
at distance at most I from vertex 1 that will not be informed in the first 20d 2 £ = O (In Inn) 
rounds. Moreover, d-j^j = n )-, and the proof is completed. □ 



4.2. The Exposure Strategy. In this section we will describe our general strategy for 
determining the probable broadcast time of the randomized rumor spreading protocol. We 
will denote by X t the set of informed vertices and by lit the set consisting of the uninformed 
vertices, i.e., Ut = [n] \ It, at the beginning of round t. We have that X\ = {1}. We can 
simulate the execution of the rumor spreading protocol as follows in two steps. First, we 
choose one of the clones of vertex 1 uniformly at random, say c\. Then, we expose the 
edge in the random matching whose one endpoint is c±, and pass the message to the other 
endpoint, say C2- Note that this is equivalent to selecting uniformly at random a clone d 
different from c\, and joining c\ and d by an edge. Clearly, C2 is a clone that corresponds 
to some vertex in the original graph, which now becomes informed. This completes the first 
round, and X2 consists of vertex 1 and the vertex corresponding to c%. 

This gradual exposure of the graph can be generalized to any other round in the following 
manner. Suppose that we are in the beginning of round t + 1 > 0. We will simulate the 
execution of the protocol as follows in two steps. 

Step 1. For each v £ It we choose one of its clones uniformly at random, independently for 
every such vertex. We shall denote the selected clone by c v = c v (t). 

Step 2. Set It+i = It and let v G If If Cv belongs to an edge in the random matching that 
was exposed in one of the previous rounds, do nothing. Otherwise, choose uniformly 
at random one of the remaining unmatched clones, say c, and connect it to c v by an 
edge. Add the vertex corresponding to c to It+i, if it isn't already contained in It+i- 

If a clone of a vertex in Ut is matched to c v , for some v £ It, then that vertex becomes 
informed - we denote by Mt+\ the set of those vertices. In short, Aft+i is the set of newly 
informed vertices in the t + 1st round. Let us introduce some further notation regarding the 
two exposure steps. At the beginning of round t + 1, we denote by Vt the set of clones of the 
vertices in It whose neighbors have not been exposed yet (i.e., in none of the previous rounds 
the edges in the matching containing those clones were exposed). Among those, during Step 1 
we choose a set At+i Q Vt of clones. Informally, At+i contains the clones through which new 
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vertices might get informed. Finally, we write N t +± = \Aft+i\, A t +i = |*4t+i| and Pt = \Vt 
and note that Vq consists of the d clones of vertex 1. 



The two steps of our exposure strategy can be also viewed as follows. In the first step we 
choose according to the rule described above a random subset At+i of Vt- Then, in Step 2, 
the clones in At+i are matched to the union of the clones in Vt and the clones corresponding 
to the vertices in Lit (as, per definition, all other clones are already matched). In other words, 
we consider a random perfect matching Ait+i on the set of clones in Vt and Lit, and we will 
study its combinatorial properties. In particular, the following claim relates the random 
quantities in question. 

Proposition 7. Let Ha+\ denote the number of vertices in Lit that were informed i times 
in round t + 1, i.e., a vertex v is counted in Hn+i, if there are i clones in At+i that are 
matched to the clones of v in M.t+1- Then 

d 

(4.1) N t+ i = ^2H itt+1 <e Mt+1 (At +1 ,Ut), 

i=l 

(4.2) I t+1 = It + Nt+i and U t+1 = U t - N t+1 , 

d 

(4.3) P t+ i = P t - A t+ i-e Mt+1 {A t+1 ,Vt\At+i) + ^{d-i)H ijt +i. 

i=l 

Proof. The first equality in Equation (I4.ip follows directly from the definition of H^t+i, as 
every vertex in Lit has d unmatched clones, and it becomes informed as soon as at least one 
of them gets matched in Step 2 to a clone in At+i- The upper bound is also easy to see, as 
the number of newly informed vertices is at most the number of edges in A4t+i that have 
one endpoint in At+i and the other in the set of clones corresponding to the vertices in Ut- 
Equation (I4.2p follows immediately from the definition of Xt and Nt+i- Finally, to see (|4.3p . 
note first that all clones in At+i are excluded from Vt+i, as they are matched to other clones 
in Vt or Ut, this accounts for the "— ^4j+i" term. Moreover, all clones in Vt \ At+i that are 
contained in edges of Ait+i with the other endpoint in At+i are excluded from Vt+i as well, 
as the edge including them was exposed; this accounts for the the "— e^ 1+1 {At, Vt \ At+i) n 
term. Finally, for each newly informed vertex counted in Hij+i, i-e., which was informed i 
times in round t, the number of clones counted in Vt increases by d — i. □ 



For future reference we prove already here a lemma that addresses the concentration 
properties of A t +\. 



Lemma 8. For any t > 1 and n > 5 



A '-i 



> 



Pi 



din n 



P t \ <2e 3d In* n . 



Proof. For each clone c E Vt let I c be the indicator variable for the event that c is selected 
in the first step of the tth round, i.e., "7 C = 1" iff the random decisions in Step 1 are such 
that c 6 At+i ■ Since each clone has probability 1/d to be selected we have E(/ c ) = 1/d. 
Moreover, for two distinct clones c, c' we have that 



10 , if c, d are clones belonging to the same v G V n 1 , 
Wd) = 1/d 2 , otherwise " ffi = E{Ic)m 
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i.e., the 7 c 's are negatively correlated. We infer that \i := E(j4 t+ i | P t ) = and Theorem[3] 
implies that the sought probability is at most 

W(\A t+1 -fi\ > ^/ln 2 n | P t ) < 2exp ( ^ ^ < 2exp ( . 

K\ t+i — 1 tJ ~ y \ 2(/i + ^/(31n 2 n))y " y \ 3 In 4 n J 

□ 



4.3. The Middle Phases. Let T\ be the first round where the number of informed vertices 
is at least n — In 7 n, or equivalently, where Ut x < In 7 n. The main accomplishment of this 
section is the proof of the following lemma, which describes the likely evolution of the number 
of (un)informed vertices and of Pt until t = T\. 

Lemma 9. Suppose that Pt,Ut > ln 7 n. Abbreviate Ft = 1 — d ^ P ^ dUt ^ ■ Then, uniformly 
with probability at least 1 — o(jj-), 

(4.4) P 1+1 = (r- (±.)y^(l-^F t .P t + dUt{ F t -Ff ) y 

(4.5) t/ t+1 = (l- (J-)).^.^ 

Proof. Let Hi t+i denote the number of vertices in lit that were informed i times in round 
t + 1, and recall that Proposition [7] describes the relation of the quantities Pt+i and Ut+i 
to Pt,Ut and Hn+i- We will show that uniformly for all i such that Pt,Ut > ln 7 n, with 
probability 1 — o(pj— ) we have 

(4.6) A t+1 =[l + o( 



Inn/ / <i 



(4.7) eM (At + i,r t \A t+ i)=[l + o{-L)) (l-~)P t (l-F t )±ln 5 n, 



and 

) ^.Mt4 

\ vlnn// \ « 
and that for all 1 < i < d 

(4.8) = (l + ( J-) ) LT t • Q (1 - F t )*Ft* ± In 5 n. 

This proves (|4.4p and (|4.5p as follows. First, by using (|4.ip we infer that with probability 
1 — o(j— ) the number of informed vertices in round t + 1 is 

N t+1 = £ fli >H .! = (l + tft • (1 - if) ± din 5 n. 

i=l ^ nn / 

So, as C/f > ln 7 n, with probability 1 — o(jAr) the number of uninformed vertices at the end 
of round t + 1 is 

U t+1 = U t - N t+l = U t -(l + o(j^)) ft " (1 - ± dln 5 n = M + o(^)) ^t- 



This shows (|4.5p . To see (|4.4p recall (j4.3|) and note that with probability 1 — o(j^) 

^(d-i)-Hi,t+i = ( 1 + °(f n ^)) EWt-i?)±dln°n 
i=l ^ nn / 



Inn ' 

5 
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Hence, by applying (|4.3[) we infer that with probability 1 — o(j-J-) 

d 

P t +x = P t - A t+l - e Mt+1 (A t ,V t \ A t ) + ^2(d - i)H i>t+1 



and this shows (|4.4p . 

It remains to prove (|4.6p - (|4.8p . We start with (|4.6p . This is easily seen to hold, by 
applying Lemma [S] and using the fact that Pt > ln 7 n. To see (|4.7p we apply Lemma [5] with 
A = At+i,B = V t \ At+i and C = U t . We infer that 

mf (A t> \ a w A t+ i(P t - A t+ i) 

H := E(e Mt+ i{At+i,Vt \ At+i)) = p t + dUt _ 1 ■ 

Note that for sufficiently large n we have with probability 1— o(j^-) that |7 ? t\.At+i| > |^4t+i|, 
and that At+x = oj(lnn). By using (|4.6p and the definition F t = 1— we thus obtain 

"-Me;)) - (> + <±)) (> 4) «<' - «>■ 

If /i > In 3 ?i, then by applying (|3.5p with e = In" 1 ' 1 n we infer that 

meM t+1 (A t+ i,T t \ At+i) -n\> //hr 1 - 1 n) = f^ln" 1 n). 
On the other hand, if u < In 3 n, we obtain by Markov's inequality that 

V(e Mt+1 (A t+ i,Vt\ At+x) > ln 5 n) = oQn^n). 
By combining the above statements we infer that with probability at least 1 — o(jj^) we 

have that eM t+1 (At+i,Vt \ At+x) = (1 + °(hT^))A i ± ln5 n i- e -i OZZD is proved. 

The proof of (|4.8|) is very similar. By applying Lemma with A = At+x,& = V t \ At+x 
and C = tit we infer that 

n : =mii+l)= ( 1 + o(±))u,.( d \(^+^) i (l A ' 



Ann J J \ij \P t + dUt-lJ \ P t + dU t - 1 
As with probability 1 — o(j-^-) we have At+x = (1 + °(hTn))ir we infer that 

»i=(l + o(^))u t .( d )Ft(l-F t y-\ 



■■Inn J J \i / 

The proof now completes with a case distinction as above, i.e., we treat the case \l\ < In 5 n 
with Markov's inequality and the case \Xi > ln 5 n by using (|3.5p . □ 



Lemma [9] allows us now to derive probable bounds for T\ . 
Corollary 10. With probability 1 — o(l) we have that T\ — Tq = C^hxn + O(lnlnn), where 

r 1 1 



ln(2(l-i)) dln(l-a)" 

Proof. By applying Lemma [6] we infer that at round To with high probability there are for 
the first time at least In n informed vertices, and the set of informed vertices induces a tree. 
Hence, we may assume that 

In 7 n < I To < 2 In 7 n and (d - 1)I Tq < Pt < dI To ■ 
We will use those facts in the sequel without further reference. 
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Let pt and ut be given by the recursions 

Pt+l = U - ^j) ftPt + dut{ft - ff) and u t+ i = ffu t , 

where ft = 1 — ^(pt^dut) > anc ^ Pt = fr > ^T = n — It ■ As we are interested in the probable 
values of Pt and for i = O(lnn) we infer by applying Lemma [9] that pt = (1 + o(l))Pj and 
ut = (1 + o(l))C/j for all such i, provided that Ut, Pt > hi 7 n. In the sequel we shall therefore 
consider only the evolution of pt and Ut ■ 

Let q := 2 (l — 2) , £ = 0.01 and ti be the minimal t such that g t_T ° < p^-. We will first 
show that for all To < t < t\ 

(4.9) Pt<Pr -^" T ° and > P To ■ q^ T ° - 3P| Q • q 2 ^^ /n, 

and 



(4.10) u t = n- I To - Pr o q -^-^ ± 9 • P| g 2 ^)/ 



We proceed by induction on t. Note that for t = Tq the statement trivially holds. In order 
to perform the induction step (t — > t + 1) we will need some facts. First, let x = 1 — ft and 
note that 

/ ,-j-_(i-,)-(i-,)^( i -i),_ (i -i) 55 -S 5 - J <^le. 

So, we readily obtain the upper bound for in (|4.9p by using the the recursion for pt as 
follows. 

Pt+i < (l - ~) /tPt + di*t • " 2 ( X " 5) p * = m ^ Pm " Pt ° ' 1 t+1 ~ T °- 

To see the lower bound for pt, note first that the induction hypothesis, together with the 
fact that j/*" 7 " < t^t- imply that — < 1. Thus, r-4- > 1 — -p-. A similar calculation as 

^ — In' n 1 J «t ' — dut 

above and by using the fact (1 — x) d < 1 — dx + (^}x 2 for x > reveals that 



2y ~ d p t + du t 2 d?{p t + du t ) 2 

d-l p t p\ d - 1 pt 3p 2 



> — ^ ^ = > 



d2 u *( 1 + lir) 2d 2 u 2 " d 2 u< 2d 2 u 2 ' 
By using again the recursion for pt we infer that 

*>fi X \ t ^ fd-lpt 3p 2 \ 2p 2 
Pt+1 -V'-d) ftPt + dUt • {—u t -2d^)- m ~du t - 

Note that the induction hypothesis and the fact q 1 ^ < imply that u t > n/2. So, 

a op2 _2(t-To)+l A 

n \q «<?/ n 

This proves the lower bound for p t in (|4.9p . Next we prove the bounds for Ut+\. Note that 

ut+i (-. Pt \ d . Pt Pt . Pt 



V ^(Pi + / + du t du t d 
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A similar calculation using the fact (1 — x) d < 1 — dx + i^)x 2 for x > reveals that 

u t+i < x _ Pt + ( d \ Pt < i Pt | 3p t 

u t ~ p t + dut \2J d 2 (p t + du t ) 2 ~ du t 4u 2 ' 

Recall that the induction hypothesis guarantees Uf > n/2. The above facts together with 
the bounds for pt imply after a straightforward but lengthy calculation (j4. 10p . We omit the 
details. 

The above discussion settles the growth of pt and ut up to the time t\. Note that t± = 
ln(2(l — Inn + (In Inn). In order to deal with t > t\ let us first make two important 
observations. First, note that at t\ we have that 

(4.11) ^ = fi(l). 

Let us next consider the ratio r t := p t /u t . Note that f t = 1 - d ^ dut) = 1 - d {i+d/r t ) - Tlie 
recursions for pt and Ut imply that 

r w = (i - 1) fr d+1 n + d( ft - d+1 - 1) =* r -f = (i - /r d+1 + ^(/r d+1 - 1). 

Consider the function 

1 d\A 1 V d+1 d 



g(x) =!-- + _ 



d x y \ d{l + d/x)J x 



rt 

In the sequel we will argue that g is monotone increasing. This implies > g(0) > |, and 



and note that = g(rt). A straightforward calculation shows that lim^-s.o ff^) = 2(1 — ^). 
In the sequel we will argui 
so we have for any t' > 

(4.12) r t+t > > r t ( ^ ) p t+t , > ( ^ ) u t+t ,. 



4\'' /4^' 



This fact will become very useful later on. To see why g is increasing, note that 
✓ (,) _ =3*^Lt* where T-(l 1 



x 2 + xd V d(l + d/x) 

Suppose that there is an xq > such that g'(xo) = 0. Then — T + d 2 = xq(T — d). However, 
we always have 1 < T < d. Thus, the right-hand side of the above equation is < 0, while 
the left-hand side is > 0. We infer that there is no such xq, and therefore the sign of g'{x) 
equals the sign of </(0). As the latter is easily seen to be positive, this concludes the proof 
of the monotonicity of g. 

Let t2 be the minimal t such that p t2 > ut 2 In 2 n. The Equations (|4.11|) and (|4,12|) 
guarantee that ti = t± O(lnlnn), and moreover that for any t > ti such that ut > we 
have pt > ut In 2 n > 1. Under these conditions note that 

Thus, for any t such that t = t2 + O(lnn) we have that 

ut = (l + (l)). (1--J u t2 . 
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Recall that T\ is the first t such that Ut ± < In n. As ut 2 < n, we readily obtain that T\ < 
t\ + 0(ln In n) — * — In n = C<2 In n + 0(ln In n). To see the corresponding lower bound 

a ln((l— -g )) 

for T\, note that as long as pt > 1 we always have 

u t+1 > (l - 1) u 4 . 

The proof completes with the fact it^ = 0(n). □ 



4.4. The Final Phase. Let T\ be the first time such that the number of uninformed vertices 
drops below ln 7 n. In the previous section we argued that T\ = Cd In n + O (In Inn), where 
is given in CorollaryllOl The main aim of this section is to prove that the broadcasting of the 
message completes after additional 0((lnlnn) 2 ) rounds. This is shown in the next lemma. 

Lemma 11. With probability 1 — o(l) we have T — T\ = 0((lnlnn) 2 ). 

Proof. Before we show the claim let us prove a auxiliary fact. Let S be any subset of the 
vertices of G(n, d) of size at most ln 7 n. We will show that with probability 1 — o(l) 

e(S) < 1.1\S\. 

To see the claim, suppose that there is an S such that e(S) > 1.1s, where we set s = \S\. 
There are (™) < choices for the set S. Moreover, there are at most s 2 ' 2s ways to choose 

1.1s edges in S. Finally, the probability that the chosen edges are in G(n, d) is M ^~ 2,2s , 
where M x denotes the number of perfect matchings on x vertices. Using (I3.8P we infer that 

P(3S : e, S , >!.!,„, S (1 + *„(=)• ■ ^ • ^^0^1 

< (l + o(l))(e 2 ' 1 s L2 n) s - (dn)- Lls . 

This expression is ra"^ 1 ) for any s < In 7 n; this concludes the proof of the auxiliary claim. 
In particular, G(n, d) is such that any set S of at most In 7 n vertices satisfies with room to 
spare 

e{S, V n \ S) > (d - 2.2)s > ds/A. 

With this fact at hand it is routine to complete the proof of the lemma. Indeed, let S 
be the set of uninformed vertices at some point in time after Ti. So, \S\ < ln 7 n. As 
e(S,V n \ S) > ds/A, we know that at least s/4 vertices in 5 have at least one neighbor in 
V n \ S. More precisely, there is a set S 1 C S such that > s/4 and for all v E S 1 there is 
at least one v' E V n \ S such that v and v' are joined by an edge. 

Denote by B the event that after 10 In In n rounds there is a v E S' that was not informed 
by v 1 . The probability for this event is at most 

, l\ 101nl ™ (m < ln v n) _ x 
\S | • I 1 — - 1 = o(m n). 

So, after 10 In Inn rounds the new set of uninformed vertices has size at most |S'\S"| < %\S\. 
Iterating the above argument O(lnlnn) times finally completes the proof. □ 
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5. Randomized broadcasting on expanding graphs: proof of Theorem [2] 

In this section we prove Theorem [21 thus bounding the broadcast time on connected 
(n, d, A) graphs with A = 0{-\fd). In order to avoid any confusion we stress that this condition 
is interpreted as follows: there is a C > such that for any n sufficiently large A < Cyd. 

We will use the main result from [14] , Before we state it, let us first introduce the notion 
of a (p,e)-typical graph. A graph G = G(V,E) on n vertices is called (p, e)-typical, if the 
following three conditions are satisfied: 

• For any S C V with \S\ > e 2 n, there is a set X s C V \ S with \X S \ < ^ such that 

Vv€(y\S)\X s :d s (v) = (l±e)p\S\. 

• For any S C V with \S\ < e 2 n, there is a set Xs C V \ S with \Xs\ < s\S\ such that 

Vw G (V \ S) \ X s : < epn. 

• For all 5 C V we have 

e(S,V\S) = \S\(n-\S\)p(l±8^). 
The following appears in |14| . 

Lemma 12. Let e = e(n) 6e a positive real-valued function such that e(n) — > 0, as n — > oo, 
6ui e > In" 1 / 9 /!. Let p > p-^- If G is a (p,e) -typical graph and v G V, £/ien 
probability 1 — o(l) 

|T(G,u) - (log 2 n + lnn)| < 3e 1/3 Inn. 

We will show that an (n, d, A) graph is (p, e)-typical with p = d/n and e > In 1 / 9 ?!. In 
particular, we will prove the first two conditions by sampling uniformly at random a vertex 
in V, and then showing with Chebyschev's inequality that its degree in a given set S is 
concentrated around its expected value which, as we shall see, equals d\S\/n. 

Let A be the adjacency matrix of G and let e%, . . . , e n be an orthonormal basis of M. n 
consisting of the eigenvectors of A, ordered according to the moduli of the corresponding 
eigenvalues Ai, . . . , A n . Since G is d-regular and connected, we have e\ := ^=[1, • • • , 1] T (cf. 
Proposition 2.10 in |20j ) and the corresponding eigenvalue is d. For the sake of notational 
convenience, we will fix an ordering on V, namely v\, . . . ,v n and we will assume that the 
ith entry of each vector corresponds to V{. 

Let S be an arbitrary subset of V and let xs be the characteristic vector of S, that 
is, the vector indexed by V where the elements corresponding to the vertices of S are 
equal to 1 and the remaining ones are equal to 0. We set ds := A\s and note that 
ds = [d s (vi),...,d s (v n )} T . 

Let v be a vertex in V chosen uniformly at random. Thus K(ds(v)) = y t ^2 u ^v ds( u )- 
Note that this sum is just ; where (•, •) denotes the usual dot product in R n . On the 

other hand, we can express ds = A\s also by taking the expansion of xs with respect to 
the basis e\, . . . , e n and then multiplying by A. Note that {xs, = „ [lj • • • > 1] T - Thus 

Xs= l -^[l,...,l] T + ^2{xs,e i )e l . 

i>2 

Therefore 

Ax* = ^[l,...,l] T + 2><XS,ei>fii. 

i>2 
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Since e\ is orthogonal to the vectors e2, . . . , e n , we have 

(dfif,ei) = (Axs,ei) = ^—y/n, 

implying that 

d:=K(d s (v)) = ^. 

n 

In the following we will bound the variance of ds(v). Write Vax(ds(v)) = D/n, where 
D '■= T l uev d s( u ) ~ n <P- But Enev d s( u ) = \\ d s\\ 2 - B y Pythagoras' Theorem 



||d S || 2 = £(d S , e 4 > 2 = nd 2 + £(d 5 , ei) 2 . 

i=l i=2 

Therefore, D = ^"^(ds', ej) 2 . To bound the latter sum note that 

n n n n 

£<d s , e t ) 2 = Y,( A XS, ei} 2 = £> s , Ae % ) 2 = £ X 2 ( X s, e*} 2 

i=2 i=2 i=2 i=2 

< A 2 J2( XS , e % ) 2 = A 2 (||X5|| 2 - (XS, ei) 2 ) = A 2 (\S\ - ^) = A 2 |5| (l - 



i=2 

Thus 

£<a»M/'i_!3 



Var(d s (i>)) 

n n \ n 

Now we are ready to derive the first two conditions of the definition of (d/n, e) -typicality, 
when A < C\fd. 

• Let S be such that \S\ > e 2 n. Then the size of X$ is bounded from above by 
n¥(\ds(v) — d\ > ed). We bound this probability with Chebyschev's inequality. 
Indeed, 

/ 1 j / \ t, y, Vax(d s (v)) X 2 \S\/n A 2 nC 2 C 2 

By the choice of e, the above bound is at most 8/ Inn and therefore \X$\ < 8n/lnn. 

• Now let \S\ < e 2 n. Thus d < de 2 . Here the size of X$ is bounded from above by 
n¥(ds(v) > de). Since for n large enough de — de 2 > de/2, this probability is at most 
¥(ds(v) — d > de/2). Again, Chebyschev's inequality implies 

/ 7 / \ t i / \ 4Var(d s (v)) 4A 2 |5|/n 4A 2 e 2 4A 2 4C 2 

Thus < j U - We want to deduce that this is at most e\S\. We may assume that 
|5| > de, as otherwise what we are aiming at holds trivially. So, it suffices to deduce 

that < de. But this holds by our assumption d > V 4C 2 nln 1 / 9 ra > V^ 2 ~e~hi. 

The third condition in the definition of (d/n, e)-typicality is a standard property of (n, d, A) 
graphs. 

Theorem 13 (Theorem 2.11 in [20]). Let G = G(V,E) be an (n,d,X) graph. Then for any 
two subsets U, W C V we have 



e(U,W)- d MW 



n 



n I \ n 
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We set U = S and W = V \ S. Then the above implies 
d\S\(n-\S\) 



e(S,V\S) 



n 



<XJ\S\{n-\S\)[l 



n l \ n 



_ X\S\(n- \S\) _ A d\S\(n-\S\) C_ d\S\(n-\S\) 
n d n ~ yfd n 

Since d > V4C 2 e _1 n, we have -§= < But e > In -1 / 9 n and, therefore, the latter bound 
is at most 8e 1//2 , as required to satisfy the third condition. 

Acknowledgment We would like to thank Colin McDiarmid for suggesting the use of his 
concentration inequality (Theorem 2]), which greatly facilitated our proofs. 
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